Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications
نویسندگان
چکیده
Egocentric videos offer fine-grained information for high-fidelity modeling of human behaviors. Hands and interacting objects are one crucial aspect understanding a viewer’s behaviors intentions. We provide labeled dataset consisting 11,243 egocentric images with per-pixel segmentation labels hands being interacted during diverse array daily activities. Our is the first to label detailed hand-object contact boundaries. introduce context-aware compositional data augmentation technique adapt out-of-distribution YouTube video. show that our robust model can serve as foundational tool boost or enable several downstream vision applications, including hand state classification, video activity recognition, 3D mesh reconstruction interactions, inpainting foregrounds in videos. Dataset code available at: https://github.com/owenzlz/EgoHOS .
منابع مشابه
Left/right hand segmentation in egocentric videos
Wearable cameras allow people to record their daily activities from a user-centered (First Person Vision) perspective. Due to their favorable location, wearable cameras frequently capture the hands of the user, and may thus represent a promising user-machine interaction tool for different applications. Existent First Person Vision methods handle hand segmentation as a background-foreground prob...
متن کاملGesture-based Bootstrapping for Egocentric Hand Segmentation
Accurately identifying hands in images is a key subtask for human activity understanding with wearable firstperson point-of-view cameras. Traditional hand segmentation approaches rely on a large corpus of manually labeled data to generate robust hand detectors. However, these approaches still face challenges as the appearance of the hand varies greatly across users, tasks, environments or illum...
متن کامل, Fine - Grained Concurrent Object - Oriented
The introduction of concurrency complicates the already diicult task of large-scale programming. Concurrent object-oriented languages provide a mechanism, encapsulation, for managing the increased complexity of large-scale concurrent programs, thereby reducing the diiculty of large scale concurrent programming. In particular, ne-grained object-oriented approaches provide modularity through enca...
متن کاملIAIR-CarPed: A psychophysically annotated dataset with fine-grained and layered semantic labels for object recognition
0167-8655/$ see front matter 2011 Elsevier B.V. A doi:10.1016/j.patrec.2011.10.003 ⇑ Corresponding authors at: Institute of Artificial In Jiaotong University, 28 West Xianning Road, Xi’an 71 +86 29 8266 8802x8038; fax: +86 29 8266 8672 (Y. W E-mail addresses: [email protected] (Y. Wu (Y. Liu), [email protected] (Z. Yuan), nnzheng@ 1 http://mm.media.kyoto-u.ac.jp/members/yangwu/r Unlike...
متن کاملFine-Grained and Layered Object Recognition
This paper presents a novel research on promoting the performance and enriching the functionalities of object recognition. Instead of simply ̄tting various data to a few prede ̄ned semantic object categories, we propose to generate proper results for di®erent object instances based on their actual visual appearances. The results can be ̄ne-grained and layered categorization along with absolute or ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19818-2_8